Using Graph Transformation Techniques for Integrating Information from the WWW

نویسنده

  • Lukas C. Faulstich
چکیده

The advent of the WWW has led to an abundance of information which is distributed on numerous Web sites of heterogeneous structure and coverage. It is therefore important to extract, combine and restructure these distributed data in order to facilitate the access to it. The HyperView methodology models Web sites and their HTML pages as graphs from which information is extracted by a hierarchy of views. We present the clustered graph data model (CGDM) used in the HyperView system. Views are defined using graph transformation rules. We use typed attributed Single Pushout graph transformation with application conditions on attributes. The main contribution of this paper is a new rule activation mechanism that supports the incremental computation of views, a crucial requirement in the context of information extraction from Web sites. The HyperView prototype is currently used in the field of Digital Libraries. In this paper, we demonstrate our methodology in the domain of town information.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Meta-heuristic Approach to Cope with State Space Explosion in Model Checking Technique for Deadlock Freeness

Model checking is an automatic technique for software verification through which all reachable states are generated from an initial state to finding errors and desirable patterns. In the model checking approach, the behavior and structure of system should be modeled. Graph transformation system is a graphical formal modeling language to specify and model the system. However, modeling of large s...

متن کامل

LPKP: location-based probabilistic key pre-distribution scheme for large-scale wireless sensor networks using graph coloring

Communication security of wireless sensor networks is achieved using cryptographic keys assigned to the nodes. Due to resource constraints in such networks, random key pre-distribution schemes are of high interest. Although in most of these schemes no location information is considered, there are scenarios that location information can be obtained by nodes after their deployment. In this paper,...

متن کامل

Pii: S0098-3004(99)00073-4

A tool called GeoVR has been designed and developed under a client/server architecture to enable the interactive creation of a 3D scene and virtual reality modeling language (VRML) model from 2D spatial data by integrating Internet geographical information system (GIS) and HTML programming. The client front-end of this tool provides an HTML form to set properties for building 3D scenes, while t...

متن کامل

Modeling and Navigation of Large Information Spaces: A Semantics based Approach

In this paper we present techniques for modeling the semantics of large information spaces and for navigating them. This information space represents heterogeneous data stored in different formats and distributed across multiple locations on the Internet. We also describe a prototype system called SEMQUEST (SEMantics based QUEry SysTem) that employs graph-based algorithms and allows users to in...

متن کامل

Master Theses Proposal Applications of Graph Transformation Systems to the specification of web applications

The World-Wide Web (WWW, W3) is a well-known distributed information system, that has grown during the nineties, and it is presently the most common way to publish information on the internet (see [W3]). Despite its tremendous expansion and impact on the computer industry, the basic WWW data model is still very poor (namely a flat collection of documents called pages, with references between th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998